NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Enumerative theory for the Tsetlin library

https://doi.org/10.1016/j.jalgebra.2023.08.009

Chatterjee, Sourav; Diaconis, Persi; Kim, Gene B (October 2024, Journal of Algebra)

Full Text Available
Promoting Comprehension and Engagement in Introductory Data and Statistics for Blind and Low-Vision Students: A Co-Design Study

https://doi.org/10.1145/3706598.3713333

Fan, Danyang; Tomassetti, Olivia; Mouallem, Aya; Kim, Gene S-H; Patel, Shloke Nirav; Hwang, Saehui; Leader, Patricia; Sugrue, Danielle; Chen, Tristen; Ou, Darren Reese; et al (April 2025, ACM)

Free, publicly-accessible full text available April 25, 2026
Tangible Stats: An Embodied and Multimodal Platform for Teaching Data and Statistics to Blind and Low Vision Students

https://doi.org/10.1145/3613905.3650793

Fan, Danyang; Kim, Gene S-H; Tomassetti, Olivia; Patel, Shloke Nirav; O'Modhrain, Sile; Lee, Victor R; Follmer, Sean (May 2024, ACM)

Interactive data learning tools provide explorable ways for students to build intuitions about data, data representations, and statistical parameters. However, these tools rely on visual consumption and are not accessible to blind and low vision (BLV) students. In this work, we investigate opportunities to leverage active exploration, enriched with multimodal feedback and embodied interaction, to foster an understanding of the relationships among individual data values, data representations, and statistical measures. We explore these opportunities in the form of an accessible learning platform that allows students to hear and feel how statistical measures are changing in real time as they construct and manipulate physicalized data representations. We introduced the platform to four teachers of students with visual impairments (TVIs) through a two-hour-long focus group. TVIs embraced the platform’s exploratory nature and universality and recommended the consideration of additional auditory and texture-based interactions to enhance engagement.
more » « less
Full Text Available
Semantically informed data augmentation for unscoped episodic logical forms

Juvekar, Mandar; Kim, Gene; Schubert, Lenhart (June 2023, 15th International Conference on Computational Semantics)

Unscoped Logical Form (ULF) of Episodic Logic is a meaning representation format that captures the overall semantic type structure of natural language while leaving certain finer details, such as word sense and quantifier scope, underspecified for ease of parsing and annotation. While a learned parser exists to convert English to ULF, its performance is severely limited by the lack of a large dataset to train the system. We present a ULF dataset augmentation method that samples type-coherent ULF expressions using the ULF semantic type system and filters out samples corresponding to implausible English sentences using a pretrained language model. Our data augmentation method is configurable with parameters that trade off between plausibility of samples with sample novelty and augmentation size. We find that the best configuration of this augmentation method substantially improves parser performance beyond using the existing unaugmented dataset.
more » « less
Full Text Available
The Accessibility of Data Visualizations on theWeb for Screen Reader Users: Practices and Experiences During COVID-19

Fan, Danyang; Siu, Alexa F.; Rao, Hrishikesh V.; Kim, Gene S-H; Vazquez, Xavier; Greco, Lucy; O’Modhrain, Sile; Follmer, Sean (March 2023, ACM transactions on accessible computing)

Data visualization has become an increasingly important means of effective data communication and has played a vital role in broadcasting the progression of COVID-19. Accessible data representations, however, have lagged behind, leaving areas of information out of reach for many blind and visually impaired (BVI) users. In this work, we sought to understand (1) the accessibility of current implementations of visualizations on the web; (2) BVI users’ preferences and current experiences when accessing data-driven media; (3) how accessible data representations on the web address these users’ access needs and help them navigate, interpret, and gain insights from the data; and (4) the practical challenges that limit BVI users’ access and use of data representations. To answer these questions, we conducted a mixed-methods study consisting of an accessibility audit of 87 data visualizations on the web to identify accessibility issues, an online survey of 127 screen reader users to understand lived experiences and preferences, and a remote contextual inquiry with 12 of the survey respondents to observe how they navigate, interpret, and gain insights from accessible data representations. Our observations during this critical period of time provide an understanding of the widespread accessibility issues encountered across online data visualizations, the impact that data accessibility inequities have on the BVI community, the ways screen reader users sought access to data-driven information and made use of online visualizations to form insights, and the pressing need to make larger strides towards improving data literacy, building confidence, and enriching methods of access. Based on our findings, we provide recommendations for researchers and practitioners to broaden data accessibility on the web.
more » « less
Full Text Available
Supporting Accessible Data Visualization Through Audio Data Narratives

https://doi.org/10.1145/3491102.3517678

Siu, Alexa; S-H Kim, Gene; O'Modhrain, Sile; Follmer, Sean (April 2022, CHI '22: Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems)

Online data visualizations play an important role in informing public opinion but are often inaccessible to screen reader users. To address the need for accessible data representations on the web that provide direct, multimodal, and up-to-date access to the data, we investigate audio data narratives –which combine textual descriptions and sonification (the mapping of data to non-speech sounds). We conduct two co-design workshops with screen reader users to define design principles that guide the structure, content, and duration of a data narrative. Based on these principles and relevant auditory processing characteristics, we propose a dynamic programming approach to automatically generate an audio data narrative from a given dataset. We evaluate our approach with 16 screen reader users. Findings show with audio narratives, users gain significantly more insights from the data. Users describe data narratives help them better extract and comprehend the information in both the sonification and description.
more » « less
Full Text Available
The Accessibility of Data Visualizations on the Web for Screen Reader Users: Practices and Experiences During COVID-19

https://doi.org/10.1145/3557899

Fan, Danyang; Siu, Alexa F.; Rao, Hrishikesh V.; Kim, Gene S-H; Vazquez, Xavier; Greco, Lucy; O’Modhrain, Sile; Follmer, Sean (August 2022, ACM Transactions on Accessible Computing)

Data visualization has become an increasingly important means of effective data communication and has played a vital role in broadcasting the progression of COVID-19. Accessible data representations, on the other hand, have lagged behind, leaving areas of information out of reach for many blind and visually impaired (BVI) users. In this work, we sought to understand (1) the accessibility of current implementations of visualizations on the web; (2) BVI users’ preferences and current experiences when accessing data-driven media; (3) how accessible data representations on the web address these users’ access needs and help them navigate, interpret, and gain insights from the data; and (4) the practical challenges that limit BVI users’ access and use of data representations. To answer these questions, we conducted a mixed-methods study consisting of an accessibility audit of 87 data visualizations on the web to identify accessibility issues, an online survey of 127 screen reader users to understand lived experiences and preferences, and a remote contextual inquiry with 12 of the survey respondents to observe how they navigate, interpret and gain insights from accessible data representations. Our observations during this critical period of time provide an understanding of the widespread accessibility issues encountered across online data visualizations, the impact that data accessibility inequities have on the BVI community, the ways screen reader users sought access to data-driven information and made use of online visualizations to form insights, and the pressing need to make larger strides towards improving data literacy, building confidence, and enriching methods of access. Based on our findings, we provide recommendations for researchers and practitioners to broaden data accessibility on the web.
more » « less
Full Text Available
A transition-based parser for unscoped episdoc logical form

Kim, Gene Louis; Duong, Viet; Lu, Xin; Schubert, Lenhart (June 2021, Fourteenth Int. Conf. on Computational Semantics (IWCS 2021))
null (Ed.)
“Episodic Logic: Unscoped Logical Form” (EL-ULF) is a semantic representation capturing predicate-argument structure as well as more challenging aspects of language within the Episodic Logic formalism. We present the first learned approach for parsing sentences into ULFs, using a growing set of annotated examples. The results provide a strong baseline for future improvement. Our method learns a sequence-to-sequence model for predicting the transition action sequence within a modified cache transition system. We evaluate the efficacy of type grammar-based constraints, a word-to-symbol lexicon, and transition system state features in this task. Our system is availableat https://github.com/genelkim/ ulf-transition-parser. We also present the first official annotated ULF dataset at https://www.cs.rochester.edu/u/ gkim21/ulf/resources/.
more » « less
Full Text Available
Montague Grammar Induction

https://doi.org/10.3765/salt.v30i0.4816

Kim, Gene Louis; White, Aaron Steven (March 2021, Semantics and Linguistic Theory)
null (Ed.)
We propose a computational model for inducing full-fledged combinatory categorial grammars from behavioral data. This model contrasts with prior computational models of selection in representing syntactic and semantic types as structured (rather than atomic) objects, enabling direct interpretation of the modeling results relative to standard formal frameworks. We investigate the grammar our model induces when fit to a lexicon-scale acceptability judgment dataset – Mega Acceptability – focusing in particular on the types our model assigns to clausal complements and the predicates that select them.
more » « less
Full Text Available
Montague Grammar Induction

https://doi.org/DOI: 10.3765/salt.v30i0.4816

Kim, Gene Louis; White, Aaron Steven (March 2021, Proceedings from Semantics and Linguistic Theory)
null (Ed.)
We propose a computational modeling framework for inducing combinatory categorial grammars from arbitrary behavioral data. This framework provides the analyst fine-grained control over the assumptions that the induced grammar should conform to: (i) what the primitive types are; (ii) how complex types are constructed; (iii) what set of combinators can be used to combine types; and (iv) whether (and to what) the types of some lexical items should be fixed. In a proof-of-concept experiment, we deploy our framework for use in distributional analysis. We focus on the relationship between s(emantic)-selection and c(ategory)-selection, using as input a lexicon-scale acceptability judgment dataset focused on English verbs’ syntactic distribution (the MegaAcceptability dataset) and enforcing standard assumptions from the semantics literature on the induced grammar.
more » « less
Full Text Available

« Prev Next »

Search for: All records